Short Term Diachronic Shifts in Part-of-Speech Frequencies: A Comparison of the Tagged LOB and F-LOB Corpora

نویسندگان

  • Christian Mair
  • Marianne Hundt
  • Geoffrey Leech
  • Nicholas Smith
چکیده

"Our Western civilization, it has been said, favours an overdevelopment of the intellect at the expense of the emotions. This is why people prefer nouns to verbs." (Potter 1975: 101) 1. Introduction In the present paper we do not aim at the type of language-based cultural criticism encapsulated in the motto, which is taken from Potter's Changing English – an interesting if somewhat impressionistic account of linguistic change in present-day English written for a popular audience. Yet the quote serves as a reminder that part-of-speech frequencies in texts are far from trivial and may indeed be revealing stylistic indicators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Preparation and Analysis of Linguistic Corpora

The corpus is a fundamental tool for any type of research on language. The availability of computers in the 1950’s immediately led to the creation of corpora in electronic form that could be searched automatically for a variety of language features and compute frequency, distributional characteristics, and other descriptive statistics. Corpora of literary works were compiled to enable stylistic...

متن کامل

Diachronic Variation in Grammatical Relations

We present a method of finding and analyzing shifts in grammatical relations found in diachronic corpora. Inspired by the econometric technique of measuring return and volatility instead of relative frequencies, we propose them as a way to better characterize changes in grammatical patterns like nominalization, modification and comparison. To exemplify the use of these techniques, we examine a ...

متن کامل

Comparison of Fluoroplastic Causse Loop Piston and Titanium Soft-Clip in Stapedotomy

Introduction:Different types of prosthesis are available for stapes replacement. Because there has been no published report on the efficacy of the titanium soft-clip vs the fluoroplastic Causse loop Teflon piston, we compared short-term hearing results of both types of prosthesis in patients who underwent stapedotomy due to otosclerosis.Materials and Methods:A total of 57 ears were included in ...

متن کامل

Part-of-Speech Tagging from "Small" Data Sets

Probabilistic approaches to part-of-speech (POS) tagging compile statistics from massive corpora such as the Lancaster-Oslo-Bergen (LOB) corpus. Training on a 900,000 token training corpus, the hidden Markov model (HMM) method easily achieves a 95 per cent success rate on a 100,000 token test corpus. However, even such large corpora contain relatively few words and new words are subsequently en...

متن کامل

Weaving web data into a diachronic corpus patchwork

This paper offers a reassessment of the role of web data in diachronic linguistic analysis. We introduce the diachronic search facilities provided by the WebCorp Linguist’s Search Engine, including the use of a new ‘heat map’ graph for the analysis of changes in collocational patterns over time. We illustrate how web data can be used to supplement data from standard corpora in lexicological stu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008